MOD-GAUSSIAN CONVERGENCE: NEW LIMIT 
THEOREMS IN PROBABILITY AND NUMBER THEORY 
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Abstract. We introduce a new type of convergence in probability the- 
^ I ory, which we call "mod-Gaussian convergence". It is directly inspired 

^^ . by theorems and conjectures, in random matrix theory and number the- 

' ory, concerning moments of values of characteristic polynomials or zeta 

>0 , functions. We study this type of convergence in detail in the framework 

of infinitely divisible distributions, and exhibit some unconditional oc- 
currences in number theory, in particular for families of L- functions over 

Li^ , function fields in the Katz-Sarnak framework. A similar phenomenon 

h-^ ' of "mod-Poisson convergence" turns out to also appear in the classical 

^^ ' Erdos-Kac Theorem. 

^" 
-)— > 

S: 

1. Introduction 

\^J I Characteristic polynomials of random matrices are essential objects in 

Q>^ ■ Random Matrix Theory, and have also come to play a crucial role in the 

cr^ , remarkable results and conjectures linking random matrices with the study 

r^ I of L-functions in number theory (see e.g. |14j for recent surveys of this 

. ■ connection) . Our present work finds its source in the study of the asymptotic 

^^ I of moments of characteristic polynomials of random unitary matrices by 

QQ ' Keating and Snaith [10], and the corresponding conjecture for the moments 

f^ . of the Riemann zeta function on the critical line. More precisely, Keating 

and Snaith proved (in probabilistic language) that if (Yn), for A^ > 1, is a 
sequence of complex random variables, with Y/v distributed like det{I — Xjy) 
for some random variable Xjsf taking values in the unitary group U{N) and 
C^ [ uniformly distributed on U{N) (i.e., distributed according to Haar measure), 

then for any complex number A with Re(A) > —1, we have 



^ 



lim ^TjE 
N-^oo N^ 



'^^' \= G(1 + 2A)' ^'-'^ 

where G is the Barnes (double gamma) function. 

Then, using a (now classical) random matrix analogy, they make the 
following conjecture for the moments of the Riemann zeta function (see 
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ID],[I1!): for any complex number A with Re(A) > —1, we should have 

cQ + itj dt = M{X)A{X) (1.2) 



lim 



T^oo(logT)^'rjo 

where M (A) is the random matrix factor, suggested by (jl.ip . namely 
while A (A) is the arithmetic factor defined by the Euler product 

where, as usual, p runs over prime numbers, and the product is here abso- 
lutely and locally uniformly convergent; see Section [4.11 for details. 

We now look at (II. ip slightly differently. If we take A = iu to be purely 
imaginary in (jl.ip . then we obtain a limit theorem involving the character- 
istic function (or Fourier transform) of the random variables Zp^ = log JItv P 
(note that |Z7v| 7^ almost surely): 

lim e-^iogJVjE[e»^iv]= jim ^^'^o^n ^^^^ulo^\Y^\^^ ^ {G{1 + iu)f 
N^oo ^ ^ N-^00 ^ ^ G{l + 2iu) 

(1.5) 

A renormalized convergence of the characteristic function as it occurs in 
(II. 5p is not standard in probability theory. However, it has now appeared 
in various places in random matrix theory and number theory, although (to 
the best of our knowledge) always under the form of the convergence of 
normalized Mellin transforms as in (jl.ip and (jl.2p . 

In probability theory, the characteristic function E(e™'^) is a more natural 
object to consider, because contrary to Mellin or Laplace transforms, it 
always exists and characterizes the distribution of a random variable Z. 
Hence, in order to look more deeply into the properties of this type of limiting 
behavior for a sequence {Z^) of real-valued random variables, we use the 
characteristic functions (the Fourier transforms of the laws), and introduce 
the following definition: 

Definition 1.1. The sequence {Zn) is said to converge in the mod-Gaussian 
sense if the convergence 

holds for all li G M, where Pn £ ^ and 7iv > are two sequences and <I> is 
a complex-valued function which is continuous at (note that necessarily 
$(0) = 1). We call {Pn,7n) the parameters, and $ the associated limiting 
function. 

The main aim of this paper is to provide a general framework in which 
convergence such as (II. 6p occurs naturally. Secondary goals are to give 
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examples, both in probability and number theory, and to argue for the 
interest of this notion. As a first example, of course, (jl.Sp shows that the 
random variables log liVp converge in mod-Gaussian sense with parameters 
(0, 2 log N) and limiting function G(l + iu)'^G{l + 2iu)~^. 

The paper is organized as follows. In Section [21 we properly define the 
"mod-Gaussian convergence" , give some immediate properties and describe 
some easy examples where it occurs. In Section [3] we show that, under 
some conditions on the third moment, mod-Gaussian convergence occurs 
for sums of Cesaro means of triangular arrays of independent random vari- 
ables. Within this framework, a characterization of the limiting function 
$ is found in the case of infinitely divisible distributions, and it is shown 
to have a representation of Levy-Khintchine type, with one extra term. 
More generally, we also give a necessary and sufficient condition for the 
mod-Gaussian convergence to hold when the laws of the (^at) are infinitely 
divisible, with an explicit expression for the limit function <I> (which again 
has a Levy-Khintchine type representation, with two extra terms now). 

In Section U we give examples of mod-Gaussian convergence in number 
theory, in two directions. First, we show that the arithmetic factor ^(A) in 
the moment conjecture, for A = iu, arises as limiting function <I>(u) for the 
mod-Gaussian convergence of very natural sequences of random variables, 
and hence so does M{iu)A{iu); this is in particular additional (though mod- 
est) evidence in favor of the conjecture (|1.2p . since if that were not the case, 
the conjecture would necessarily be false. Second, we explain how Deligne's 
Equidistribution Theorem and the Katz-Sarnak philosophy lead to the proof 
of an analogue of the moment conjecture for families of L- functions over 
function fields; this second problem was raised in particular by B. Conrey. 
We think that these facts illustrate that the philosophy of mod-Gaussian 
convergence is a potentially crucial analytic framework underlying deep is- 
sues of number theory. In addition, we interpret the classical Erdos-Kac 
theorem in a similar way, although with "mod-Poisson" convergence. 

Remark 1.2. Because of the possible relevance of this paper both to prob- 
ability theory and number theory, we have tried to write it in a balanced 
manner, so that experts in either field can understand it. This means, in 
particular, that we recall precisely some facts which, for one field at least, 
are entirely standard and well-known (e.g., facts about infinitely divisible 
distributions, or zeta functions of curves over finite fields). This also means 
that, even though we are aware of the possibility of extending our results to 
sharper statements, we have not done so when this would, in our opinion, 
obscure the main ideas for one half of our readers. 

Notation. We use the notation 

'^(f), j f{x)dv{x), J fixMdx) 
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interchangeably for the integral of a function / with respect to some measure 
V. We write, as usual in probability, xf\y for min(x, y). In number-theoretic 
contexts, p always refers to a prime number, and sums and products over p 
(with extra conditions) are over primes satisfying those conditions. 

Acknowledgments. We thank M. Yor for a number of interesting dis- 
cussions related to this project. 

A.N. was partially supported by SNF Schweizerischer Nationalfonds Pro- 
jekte Nr. 200021 119970/1. 

2. General properties of mod-Gaussian convergence 

We start with the definition of a slightly stronger form of mod-Gaussian 
convergence: 

Definition 2.1. The sequence {Zj^j) is said to strongly converge in the mod- 
Gaussian sense if the convergence in (jl.6p holds uniformly in n, on every 
compact subset of M. 



The left side of (jl.6p being continuous, the strong convergence implies 
that <I> is continuous, hence the mere convergence. 

For the mod-Gaussian convergence with parameters (3^ = In = 0, the 
convergence and the strong convergence are the same, and amount to the 
convergence in law of the variables Z^ (this is basically Levy's Theorem) . 

2.1. Formal properties. It is natural to first ask for the intuitive meaning 
of mod-Gaussian convergence (and for explanation of the chosen terminol- 
ogy). The following proposition describes what might be called "regular" 
mod-Gaussian convergence: 

Proposition 2.2. Let (X^) be a sequence of real random variables converg- 
ing in law to a limiting variable with characteristic function <I>. If for each 
N we let 

Zn = Xn -\- Gn, (2.1) 

where Gn is a Gaussian random variable independent of X^, and with mean 
Pn o-nd variance jn, then we have the strong mod- Gaussian convergence of 
the sequence (Z^), with limiting function <I> and parameters (Pnj^n). 

Proof. Because Aat and Gn are independent, we have 

by the formula for the characteristic function of a Gaussian random variable 
(see, e.g., P (16.2)]). 

The convergence in law of (Atv) implies the local uniform convergence of 
the characteristic function of A^r to $ (the easy half of the Levy Criterion) , 
hence the convergence (11. 6p holds locally uniformly in n. D 
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We see that under this scheme, the variable Z^v is decomposed into two 
terms: a variable Xn, which converges in law, and a Gaussian variable, with 
arbitrary variance and mean, which can be viewed as a "noise" added to the 
converging variables. We then think intuitively of looking at Z^v modulo 
the subset of Gaussian random variables, and then only the convergent se- 
quence remains. It is this way of producing the convergence introduced in 
Definition 11.11 which motivated the terminology "mod-Gaussian". 

Proposition 12.21 does not cover all cases of mod-Gaussian convergence: as 
we will see later, the limiting function $ of a sequence converging in the 
mod-Gaussian sense may not be a characteristic function. However, the 
intuitive picture of some converging "core" hidden by possibly wilder and 
wilder noise may still be useful. 

The next proposition summarizes a few basic properties of mod-Gaussian 
convergence that follow easily from the definition (the first part, in particu- 
lar, is another justification for the terminology). 

Proposition 2.3. (1) Let {Z^) he a sequence of real-valued random vari- 
ables for which mod-Gaussian convergence holds with parameters {Pn,^n) 
and limiting function <1>. Then the mod-Gaussian convergence holds for some 
other parameters (/Jyv'TTv) '^'^^ limiting function^ <^' , if and only if the limits 

/3= hm {(3n-13'n). 7= lim (7^-7^), (2-2) 

N-^+00 A"— >+oo 

exist in M. In this case <!>' is given by 

$'(n) = e*^"""'^/2$(u), (2.3) 

and if the strong convergence holds with the parameters {Pn,^n) it o-lso holds 
with {Pn,i'n)- 

(2) Let {Z]\f) and (Z'j^) be two sequences of random variable with mod- 
Gaussian convergence (resp. strong convergence) , with respective parameters 
{Pnt'Jn) o.nd (/37V)77v)' ^^^ limiting functions $ and $'. If Z^^ and Z'^ 
are independent for all N , then the sums {Z^ + Z'jq) satisfy mod-Gaussian 
convergence (resp. strong convergence) with limiting function the product 
$<!>' and parameters {(3^ + (3']\j,^n + 7Ar)- 

Proof. (2) follows from the multiplicativity of the characteristic functions of 
independent variables. 

As for (1), if ()2.2p holds the mod-Gaussian convergence with parame- 
ters (/37V)7Ar) ^^^ limiting function <!>' given by (12. 3p is obvious from the 
definition, as well as the last claim (it is also a special case of (2)). 

Conversely, suppose that the mod-Gaussian convergence holds with pa- 
rameters (/3^,77v) and limiting function $1. Then 



Here, $' is not the derivative of $. 
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The function $ and ^' are both continuous and equal to 1 at 0, so they 
are non-vanishing on a neighborhood [—6, 6] of 0, with 6 > 0. Then for any 
non-zero u £ [—6,6], by taking the ratio and the modulus in (j2.4p . we get 



lim e^yN-^NW/2^ 

N-^+oo 

hence the second part of (|2.2|) holds with 



<^(u) 



>0, 



2 , 

7 = ^ log 



u^ 



$'(n) 



(this limit does not depend on the choice of u). Moreover, by (j2.4p again, 

$'(n) 
for all u £ [—6, 6], and the first part of (j2.2p follows. D 

Remark 2.4. It is important to notice that the mod-Gaussian convergence 
does not require the parameter sequences /Sat and 7Ar to converge. However, 
it implies the uniqueness of the parameters {^NjIn), up to a convergent 
sequence (this is what (1) above says). 

In the most usual situations, <!> will be smooth and E(e™ '^) also, and 
comparing expansions to second order for the left-hand side and right-hand 
side of (jl.6p around u = 0, one finds that, in this situation, the following 
will hold: 

(1) When varying the parameters (using (j2.3p . there is a unique possible 
limiting function ^q such that 

$o(u) = l + o(u2), foru^O (2.5) 

(2) For this limiting function ^q, up to adding sequences converging to 0, 
we have 

However, note that in natural situations, it is by no means clear if the 
limiting function satisfies (j2.5p . for instance for (jl.Sp . It may also not be the 
most natural choice (see Proposition 12. 2p . 

Remark 2.5. Observe that (1) in Proposition 12.31 would fail, should we drop 
the requirement of continuity of <I> at 0. For example if {Z]\f) converges in 
the mod-Gaussian sense with parameters {(3n,^n), with 77V — > 00, and if 
we take (^at — > 00 with < 6^ < jn, then (jl.6p holds with the parameters 
[PniIn — 6n) as well, and the associated limiting function vanishes outside 
0. 

2.2. Remarks, questions and problems. The introduction of mod-Gaus- 
sian convergence suggests quite a few questions which it would be interesting 
to answer, to deepen the understanding of the meaning of this type of limit 
behavior of sequences of random variables. 
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We first remark that, from tlie point of view of Proposition 12.21 it is 
also natural to introduce convergence modulo other particular classes of 
random variables: given a family T = (/^a)agA of probability distributions 
parametrized by some set A, such that the Fourier transforms 

JR 
are non-zero for all n G M, one would say that a sequence of random variables 
(Ztv) converges in the mod-^ sense if, for some sequence Xn £ A, we have 

lim fL{XN, u)-^ E(e^"^'^) = <^{u) 

for all M G M, the limiting function $ being continuous at 0. Weak and 
strong convergence can be defined accordingly. A particularly natural idea 
that comes to mind is to look at the family of symmetric stable variables 
with index a G (0, 2), and parametrized by 7 G A = (0, +00), so that 

/i(^,n)=e-^l"l". 

Such examples for a ^ 2 have not (yet) been observed "in the wild" , but 
we will see in Section [4. 3l that classical results of analytic number theory can 
be interpreted as an instance of "mod-Poisson" convergence, i.e., with J^ the 
family of Poisson distributions on the integers with parameter A G (0, +00). 

Another natural generalization is the fairly obvious notion of multi-dimen- 
sional mod-Gaussian convergence for random vectors, or indeed for stochas- 
tic processes. The finite-dimensional case may be used, in random-matrix 
context, to interpret results on moments of products of characteristic poly- 
nomials evaluated at different points of the unit circle. 

Now here are some obvious questions: 

(1) Can one find a convenient criterion for mod-Gaussian convergence to 
be of the "regular" type of Proposition 12.21 .'' Is there a "weak convergence" 
description of mod-Gaussian convergence using test functions of some type? 

(2) Is the idea useful in analysis also? Here one could think that the 
analogue of the "regular" mod-Gaussian convergence would be to have a 
sequence of distributions (T/v) satisfying 

Tn = Qn -^ Sn 

where -k is the convolution product, qm is the distribution associated to a 
Schwartz function of the type exjp{iu/3N — u^jn/2), and Sn is some con- 
vergent sequence of distributions. For instance, is it possible that some 
approximation schemes for solutions of some kind of equations converge in 
the mod-Gaussian sense? Is there, then, a more direct way to recover Sn 
(numerically, for instance) than by performing an inverse Fourier transform? 
In other words, can the Gaussian noise g^ be "filtered out" naturally? 

(3) Is there a convenient criterion for a function <I> defined on R, with 
$(0) = 1 and $ continuous at 0, to be a limiting function for mod-Gaussian 
convergence, similar to Bochner's Theorem (a function (p : M — > C is a 
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characteristic function of a probability measure if and only if 93 is continuous, 
ip{0) = 1, and 99 is a positive-definite function)? 

3. Limit theorems with mod-Gaussian behavior 

In this section, we provide several theorems characterizing situations in 
which mod-Gaussian convergence, as introduced in Definition II . H holds. 

3.1. The central limit theorem for mod-Gaussian convergence. We 

start with a result which provides a very general criterion for mod-Gaussian 
convergence in the framework of the classical limit theorems of probability 
theory. This suggests that mod-Gaussian convergence is a "higher order" 
analogue of the classical convergence in distribution. 

We recall first the standard limit theorems in the setting of triangular 
arrays of independent identically distributed (in short, i.i.d.) random vari- 
ables. 

Let (^"), for n > 1 and 1 < i < n, be random variables, where the 
variables 

^1 ) ■ ■ ■ )^n 

in each row are i.i.d. with law denoted by /x„. For any integer n > 1, let 

Sn = Xi -|- . . . -|- X^ 

denote the sum of the n-th row. 

If the X^ have expectation zero and variance 1, the Law of Large Numbers 
states that 5„/n converges in probability to as n ^ -|-oo, and the Central 
Limit Theorem states that the random variables 5„/y^, which are centered 
with variance 1, converge in law to the standard Gaussian variable M{0, 1). 

The latter can be interpreted as a "second order" type of behavior of 
Sn/n, beyond the "first order" convergence to 0, by rescaling by ^/n to 
obtain variance 1. Instead of this classical normalization, we want to look 
for finer information by normalizing with a growing variance. 

Working with Sn directly does not seem to lead to fruitful results, but for 
A^ > 1, we can consider the logarithmic mean of the 5„, i.e., the random 
variables 

N ^ N 



E- = E-(^" + ---+^") (3.1) 

^ — ' n. ^ — ^ n. 



n ■' — ' n 

n=l n=l 



which have variance given by the A^-th harmonic number H]\[, i.e., we have 

^ 1 
^ n 

n=l 

Note that it is well-known that, for a numerical sequence (ti„) that con- 
verges to a limit a, the analogue logarithmic means 

1 ^ 
1 sr^Un 

vn = > — 

log Jy ^ n 
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also converge to a (see, e.g., [20, III. 9]). This shows that, intuitively, the 
Zat can "amplify" the sums Sat by a logarithmic factor. 

We now show that, under quite general conditions, the Z^r converge in 
the mod-Gaussian sense. 

Theorem 3.1. Let (^r)in>i ^^ ^ triangular array of random variables, all 
independent, and such that the variables in the nth row have the same law 
fin; <ind let assume that ^„ has mean zero, variance 1 and third absolute 
moment satisfying 

oo „ 

7 — ^ < oo, where rUn = / \x\ fin(dx). (3-2) 

Then the logarithmic means Zj^ defined by (|3.ip strongly converge in the 
mod-Gaussian sense, with parameters (0, Hj^), or with parameters (0, log N) . 

For the proof, we recall the following useful lemma. 

Lemma 3.2 ([3], Proposition 8.44, p. 180). If X is a random variable with 
E[|X| ] < cxD, then the characteristic function (j) of X has the expansion: 

where 6{u) is a function of u that satisfies liuiu^od^u) = and \S{u)\ < 
3E{\X\^) for all u. 

Proof of Theorem \3.1\ Let (pniu) be the characteristic function of /i„, and 
let 

G^(u) = e"'^^/2E[e^''^^], 
which is continuous in u. It then suffices to show that Gat converges lo- 
cally uniformly to a limiting function. By the independence assumption and 
standard properties of characteristic functions, we have 

N 

GN{u)=e''"''^/'l[M^/nr. 

n=l 

Let j4 > be fixed. Applying Lemma 13.21 with k = 2 and using the fact 
that the /i„'s are centered with variance 1, we obtain that for |m| < A and 
n > 2A, we have 

\(pniu/n) - 1| < 2nV?^^ < 1/2. 
Taking log to be the principal branch of the logarithm (which is zero at 
1) on the disk {z £ C : \z - 1\ < 1/2}, we have for iV > M > 2yl and 
\u\ < A: 

Gjv(n) = GM(u)e^'^^'^(") (3.3) 

where 

Hm,n{u)= ^ n { log (pniu / n) + —^\ . (3.4) 

n=Af-|-l ^ ^ 
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Another application of Lemma [3 . 2 1 with k = 3 combined with the inequal- 
ity I log(l + z) - z\ < 4|zp for l^l < 1/2, yields for n> M: 

■2 ^3 16^4 



u 






It now follows from the assumption (j3.2p that for any fixed M > 2 A, 
Hm,n{u) is the partial sum (in A^) of a series starting at M + 1, which 
converges uniformly in ti G [— yl, A] to a limit HM{y), as iV — > cxd. Con- 
sequently, we deduce from (jS.Sp that Gn{u) converges, as A^ — > oo, to 
Gm{u) expHN{u), uniformly in u e [— A, ^4]. Since A is arbitrarily large, 
the result follows. D 

As an easy consequence of Theorem 13.11 one obtains the following central 
limit theorem for the sum of Cesaro means as introduced in Theorem 13.11 

Corollary 3.3. With the assumptions and notations of Theorem \3.1\ the 
re-scaled random Cesaro means Zn /^J\ogN converge in law to the standard 
Gaussian law M{0, 1). 

Proof. With the notation G^ of the previous proof, the characteristic func- 
tion of Zjy/y/logN is 

Now G]\f converges locally uniformly to a function equal to 1 at 0, hence 
GN{u/\^logN) — > 1, whereas we have Hj\f ~ logA^. So the result follows 
from Levy's theorem. D 

Example 3.4. The observation of the following example, arising from Ran- 
dom Matrix Theory, was the source and motivation for the considerations 
in this and the next sections. 

Consider a sequence (7„) of independent random variables, with 7„ having 
a gamma distribution with scale parameter 1 and index n, that is with the 
density YIn)^^~^^~^ '^R+{x)- We are interested in the behavior of 

TV 

Z^ = y^-Ar. 
^-^ n 

Recalling that one can represent 7„ as the sum 7„ = X^"=i^j", where the 
Y^ for i = 1, • • • ,n are i.i.d. with gamma distribution with index 1 (or, 
"exponential distribution"), we see that Zj^ is associated by (|3.ip with the 
variables A" = Y^ — 1, which have mean and variance 1 and a finite third 
moment (so (13. 2p holds). Hence we obtain the limit formula: 



lim e^'-^'^/^jE 



N 



In 



^^^\^^\Yl^r^ 




$(u) (3.5) 



for some continuous function <l>(n) with $(0) = 1. 
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Now it turns out that this hmit can also be computed exphcitly (as was 
first done in [15 , Th. 1-2]), as a consequence of a decomposition of the 
probabiUty law of the characteristic polynomial of random unitary matrices 
(distributed according to Haar measure) as product of independent gamma 
and beta random variables. Indeed, it was shown that for any complex 
number z with Re(z) > —1, we have 



lim ^^E 



«^p "ME?^ 



n 




A^exp - G(l + z) (3.6) 



where A = a — and G is the Barnes function. 

V 27r 

When z = —iu, the left-hand side is just the left-hand side of (|3.5p . so this 
argument gives a formula for <l>(n). More precisely, a representation of the 
Barnes function obtained in \15\ Proposition 2.3], shows that the limiting 
function ^(u) in ()3.5p is equal to 

exp (-— + r ^— — f e*"^ -l-iux + ^^^] dx] . (3.7) 

^V 2 Jo x(2sinh(f))2 V 2 J J ^ ' 

This formula, for a probabilist, is very strongly reminiscent of the Levy- 
Khintchine representation for the characteristic function of infinitely divisi- 
ble distributions. Since the gamma variables are themselves infinitely divisi- 
ble, it is natural to wonder whether this example has natural generalizations 
to other such random variables. We will now see that this is indeed the case. 

3.2. Infinitely divisible distributions. In this section, we will refine The- 
orem [S?!] by finding an expression for the limiting function ^(u). The con- 
text in which we do this is that of infinitely divisible distributions. Since 
readers with number-theoretic background, in particular, may not be famil- 
iar with this theory, we first recall some basic facts, referring to the standard 
textbooks [3] and [T7] for proofs and further details. 

Definition 3.5 ([17], p. 31). Denote by ^"* the n-fold convolution of a 
probability measure ;U with itself. The probability measure ;U on R is said 
to be infinitely divisible if for any positive integer n, there is a probability 
measure fin on M such that fi = fi^*. 

Theorem 3.6 ([E], Theorem 8.1, p. 37). The following properties hold: 

(1) // fj, is an infinitely divisible distribution on M, then for u £ M, we 
have 



/oo 
e*"^^(dx) = exp 
-oo 



--au + i(3u 



+ / (e*"^ - 1 - iux\\^\<i) v{dx) 



(3.8) 
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where a > 0, (3 £ R. and v is a measure on M, called the Levy measure, 
satisfying 

u{{0}) =0 and / (x^ A l) u{dx) < oo. (3.9) 

Jr 

(2) The representation of 'jl{u) in (13.8P by a, (3 and v is unique. 

(3) Conversely, if a > 0, /3 £ M and v is a measure satisfying (j3.9p . 
then there exists an infinitely divisible distribution /i whose characteristic 
function is given by (]3.8p . 

The parameters {a,(3,v) are called the generating triplet of fi. 

Remark 3.7. If we compare (|3.7p with (j3.8p . we see that the formula (j3.7p 
arising from the mod-Gaussian convergence in Example 13.41 is not an actual 
Levy-Khintchine formula, because the function x^ A 1 is not integrable with 
respect to the measure ■r(2sinMx/2')')^ (although |xp A 1 is); this explains why 
an additional second order term is required in the integrand. 

Remark 3.8. The theorem above is the usual representation of the Fourier 
transform of an infinitely divisible probability distribution. There are many 
other ways of getting an integrable integrand with respect to the Levy mea- 
sure v, and this will be important for us (see the discussion in [17', p. 38]). 
Let hhea truncation function, that is a real function on M, bounded, with 
conpact support, and such that h{x) = x on a neighborhood of 0. Then for 
every n S R, x i— > (e*"^ — 1 — uh{x)) is integrable with respect to i^, and 
()3.8p may be rewritten as: 

/2(u) = exp --au"^ + i/3/jU + / (e*"^ - 1 - iuh{x)) v{dx) (3.10) 

where 

/•oo 

f3h=P+ / {h{x) - xlM<:i)u{dx). 



The triplet {a,l3hTt^) is called the generating triplet of /U with respect to 
the truncation function h. 

One can also express the moments of /u in terms of the Levy measure 
z^. This is dealt with in Section 25 (p. 159 onwards) in [17j . For example 
one can show that J^\x\fi{dx) < oo if and only if f,,^^\x\i'{dx) < oo. 
More generally, the measure ^ admits a moment of order n if and only if 
i|x|>i \x\^^idx) < oo; in this case, the cumulants (c^) are related to the 
moments of i' as follows: 

ci = / xfi{dx) = J + / xv{dx), 

Jr J\x\>i 

/• /»oo 

C2 = X fj.{dx) — Ci = a + X u{dx), 

Jr J-oo 

/■oo 

Cfc = / ^ v{dx), for 3 < A; < n. 
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In particular, if /x admits a first moment, then Ji^i^i |x|z^((ix) < oo and 
we can write: 



Jj-iu) = exp 



-au^ + iciu + / (e*"""' - 1 - iux) v{dx) 



(3.11) 



3.3. Mod-Gaussian convergence in the case of infinitely divisible 
variables. Motivated by Example 13.41 we now consider the setting of Sec- 
tion [3?T] when the random variables forming the triangular array {Xf) are 
infinitely divisible. All other assumptions (concerning expectation, variance, 
third moment) remain in force; recall that /i„ is the law of the n-th row of 
the array and (pn its characteristic function. 

We will see that, in fact, when the probability measures ^n are infinitely 
divisible, we can give an explicit representation of the limiting function $ 
of Theorem 13. II in terms of the generating triplets for the measures iXn- 

Indeed, using the results recalled in subsection 13. 2^ it is easily seen that 
in the current situation, we can write 



where 



and 



(t)n{u) = exp(^„(ii)), (3.12) 

(T iP r°° 

Mu) = -^+ (e*"" - 1 - mrr) z.„(dx), (3.13) 



oo 



/oo 
\xfl^nidx) < OO, (3.14) 

-oo 

^— / \x f Un{dx) < OO. (3.15) 

J —oo 



n=l 



We now find a generalization of (j3.7p in the present situation. 

Theorem 3.9. In the setting of Theorem \3 . 1\. assume further that the proba- 
bility measures fin ore infinitely divisible and satisfy the conditions (j3.12p up 
to ()3.15p . Then the sequence (Z^) strongly converges in the mod-Gaussian 
sense, with the parameters {0,Hn) and limiting function <I> = e*, where 

^i>{u) = r ("e^"^ -l-iux+ '^-^\ u{dx) 

and 

oo 

u = ^nu'^, (3.16) 

n=l 

and the measures u'^ are defined by 

POO 

y'niA) = / \A{x/n)vn{dx), 



for any Borel set A. Consequently, v is a positive measure which satisfies 
z^({0}) = and J^ \x\'^i'(dx) < oo. 
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Proof. With the notations of the proof of Theorem [STU and combining (I3.3|) . 
dMD and (f3l2]) - (f3l5]) we have: 

N „oo ^ 2 2 \ 

Ho,Niu) = ^n ( e^"^ - 1 - iux + ^ ) ly'Jdx) (3.17) 

n=i -^-^ ^ ^ ^ 

and the result of the theorem fohows immediately, with 

oo 



x\ iy{dx) = y^— I Ixl'"^ Unidx) 

n=l 



D 



3.4. A criterion for mod-Gaussian convergence. We now want to prove 
a general result characterizing the existence of mod-Gaussian convergence of 
a sequence {Z]\f) such that the random variables Z^ are infinitely divisible. 
This is the analogue of the classical criterion for convergence in distribution 
of infinitely divisible random variables due to Gnedenko and Kolmogorov 
(see for example [T71 Th. 8.7]). 

Theorem 3.10. Let {Zi^) he a sequence of real-valued random variables 
whose respective laws fiN ^'"e infinitely divisible, with generating triplets 
{(JN-,bN,UN) relative to a fixed continuous truncation function h. 

Then we have mod-Gaussian strong convergence if and only if the follow- 
ing two conditions hold: 

(1) The sequence kn = J^ h{x)'^VN{dx) converges to a finite limit k; 

(2) There exists a nonnegative measure v satisfying 

i/({0}) = 0, / (x^ A \)v{dx) < +00 

Jr 

and such that fNif) -^ ^(/) f^'^ ^''^U continuous function f with \f{x)\ < 
C(x^ A 1) for X G M and some constant C > 0. 

Under these conditions, one may take the parameters 

(3N = bN, IN = CTN + I^Nih^), (3.18) 

and the limiting function is then ^ = exp(^), where 

*(u) = -^— K + / ( e*""^ - 1 - tuh{x) + ^ + i ^ J u{dx). 

(3.19) 

Proof. The left-hand side of (|1.6p can be written, in our case, as exp(^jv(n)) 

where 

^2 f . 

^7v(n) = iu{bN - Pn) - ^{ctn- In) + / (e*""' - 1 - iuh{x))uN{dx) 

^ Jr 

(3.20) 

= iu{bN - Pn) - Tr(o"Af + J^N{h^) - In) - i-^^N + VNiK)-, (3.21) 
2 
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where 

ku{x) = e™^ - 1 - iuh{x) + ^ + t ^. (3.22) 

2 

(a) First assume that Conditions (1) and (2) above hold, and define Pn 
and 7Ar by ()3.18p . Since h is bounded with compact support and h{x) = x in 
a neighborhood of 0, it is easily seen that |fc„(x)| < Cu{x'^ A 1), where Cu is a 
constant depending on u. It thus follows from ()3.21|) . (13.180 and Conditions 
(1) and (2) that ^Ar(n) -^ ^(u), as defined by (I3.19p . Thus, it only remains 
to prove that we have VN^ku) -^ z^(^u) uniformly (with respect to u) on all 
compact sets X C M. For this, we use fairly standard arguments. 

Let K he a compact subset of M, and let e > be fixed. For any A > 1, 
let 

'O if -\x\ < A 



9a{x) 



^ - 1 if A < |x| < 2^ 



1 if \x\ > A, 

and hA = I — QA- The function qa is continuous with < gf^ < x^ A 1, and 
QA ^ pointwise as A ^ oo. Therefore we have the limits 

lim i'n{gA) = v{gA), lim /^(^a) = 0. (3.23) 

Now write 

\T^n{ku) - l^{ku)\ < WnigAku)\ + WigAku)\ + Wn{hAku) " l^{hAku)\. 

There exists a constant Ck > 0, depending only on K, such that 

\kuix)gA{x)\ < CxgAix) 

for all u E M, x G M. Consequently, combining the two limits in (j3.23p . we 
see that there exists Nq > 1 and i? > such that, for any A^ > Nq, we have 

\yn{gBku)\ + \i^{gBku)\<2e, (3.24) 

uniformly for n G M. 

The other term, where now ^ = i? is fixed, is also easily dealt with. 
First, kuhs is continuous and satisfies \ku{x)hB{x)\ < Cu{x^ A 1), for some 
Cu depending on n, so that by assumption we have 

lim i^Nikuhs) = J^ikuhB), 
for any u € M. In addition, for each fixed x, the map 

u 1-^ ku{x)hB{x) 

is differentiable and its derivative is bounded by Dk{x^ A 1) for x G M 
and u G K, the constant Dk depending only on K. Consequently, the 
function u h^ i^Nikuhs) is differentiable on M, and its derivative is uniformly 
bounded on the compact set K. It follows that the family of functions {u i— > 
VN{kuhB))N is equicontinuous on K, and therefore its pointwise convergence 
to i'{kuhB) is uniform on K. Thus for some A^i, we have 

Wnihsku) - v{hBku)\ < e 
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for all N > Ni and u £ K. From this and the previous estimate ()3.24p . the 
uniform convergence on compact sets follows. 

(b) Conversely, we assume that (jl.6p holds, locally uniformly in u, with 
some parameters {Pn,1n) and limiting function $. Since ^ is continuous 
and non-vanishing on some neighborhood / = {—26, 25) of and $(0) = 1, it 
follows from basic results of complex analysis (see, e.g., [T7l Lemmas 7.6 and 
7.7]) that on / we have $ = exp(^), where ^ is continuous with ^(0) = 0, 
and moreover ^n, as defined by (13.2ip . converges to ^ uniformly on /. 

We deduce that, if is a locally bounded function on R, we have 

An{u) = / {^N{u + y)-'^N{u))'&{y)dy 



-^ A{u) = / {'^{u + y)-'^{u))i){y)dy (3.25) 

for all u such that \u\ < 5. 

Now, we observe that the orthogonality properties 

Hy)ydy = [ ^{y)y^dy = O for ^y) = -^ {by^ - 3^2), (3.26) 



s 



105 



together with Fubini's Theorem allow us to eliminate the terms involving 
h{x), UN and b^ in (|3.20p . to yield 



An{u) = / e'^'^g{x)vN{dx), (3.27) 



where 



g{x) = [ ^{yW^y-l)dy 
J-5 

6'^ /86^ i6^sm{6x) 206cos{5x) 20sin(fe) 

105 \ 3 X x^ x^ 

The function g is continuous and, as checked by straightforward calculus, 
satisfies 

g{x) r^ x^ as X ^ 0, C^^{x'^ Al) < g{x) < Cix"^ Al) Vx G M, (3.28) 

for some constant C > (in particular, g{x) = if and only if x = 0). Note 
that A]\f is the Fourier transform of the positive finite measure ii]\f{dx) = 
g{x)i']\f{dx), and the convergence (|3.25p for all u with |n| < 5 implies (see 
for example the proof of Theorem 19.1 in [8j) that the sequence of measures 
//AT is relatively compact for the weak convergence (or, tight). 

In particular, there is a subsequence {fJ-N^) which converges weakly to a 
positive finite measure /i. Then obviously Condition (2) is satisfied by the 
sequence (i^Nk), with the limiting measure u defined by 

''(dx) = ——lM.\{o}Kdx). 
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Next, we check that Condition (1) is satisfied by the subsequence (katj.)- 
To this end, we observe that the imaginary part of ^at (u), as given by ()3.2ip . 
can be written as follows: 

Im(^jv(w)) = 'u(&7V — Pn) + / (sin(ux) — uh{x))vN{dx) 

Jm. 



u^ 



where 



u{bN - Pn) - — i^n + vn{ 
b 



"u\x) = sin(n3;j — uh[x) + 



6 

Since ^ n{u) -^ ^{u)^ we have Im(^Ar(n)) — > Im(^(n)) for all u £ I. The 
function i^ is continuous and also satisfies |^«(x)| < Cu{x^ A 1), for some 
constant Cu > 0, and hence UNki^u) -^ t^{^u)- Consequently, 

u 
u{hNk - pNk) - y ^^fe -" Im(^(«)) - 2^(4) 

as A: ^ +CXD, and for u £ I. Then obviously Im(^(n)) — v{lu) = au + bv? 
for some a, 6 G M, and it follows that k^^^ -^ k = 6b. This proves Condition 
(1) for {knJ. 

To conclude the proof, we apply the sufficient condition (a) of our theorem 
to the subsequence (Zn^.), which implies that $ = e*, where ^ is given by 
(I3.19p . Therefore $ does not vanish, and henceforth we can take / = M 
in the previous proof. We deduce that the convergence ()3.25p holds for all 
ti G M, and then Levy's Theorem yields that not only is the sequence (^Af) 
tight, but it actually converges to a limit fi. Therefore the previous proof 
holds for the original sequences {i^n) and (kn), and we are done. D 

We now state as a corollary a weak limit theorem for variables satisfying 
the assumptions of Theorem 13. lOi Of course, (1) below is a classical result. 

Corollary 3.11. Let Zj^ he a sequence of infinitely divisible random vari- 
ables satisfying one of the equivalent conditions of Theorem \3.1(A with gen- 
erating triplets (0, 0,z/Ar). Then we have: 

(1) If i']\f{h'^) -^ a £ [0, +oo[, then the sequence {Zn) converges in law 
to a limit random variable Z , which is necessarily infinitely divisible, with 
generating triplet (0,(T, z^) with v as in (|3.19p . but in this case x^ A 1 is 
integrable with respect to v, and a = a — v{h?). 

(2) If VN{h'^) -^ +CXD, then Zj\f/y^iJiy(hF) converges in law to the standard 
Gaussian random variable M{0, 1). 

Proof. The results follow from the fact that under our assumptions, we have 
locally uniformly for n G M. D 
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4. Some examples of mod-Gaussian convergence in arithmetic 

In this section, which is largely independent of the previous one, we give 
two examples of (unconditional) instances of mod-Gaussian convergence in 
analytic number theory. The first is quite elementary and formal. For the 
second (involving function fields), we again summarize briefly the required 
information to understand the statements, this time for probabilist readers. 
In addition. Section [4.31 explains how to interpret the Erdos-Kac theorem in 
terms of mod-Poisson convergence. 

4.1. The arithmetic factor in the moment conjecture for C,{\/2 + it). 

We come back to the moment conjecture (II. 2p for the Riemann zeta function, 
which we recall: we should have 

lim ——^-—T2 I \C{\ + it)\'^^dt = A{\)M(\) 
T^+ooT(logr)^' 7o \ > y I 

for any complex number A such that Re(A) > — 1, where 

M{\) = — — — — , G{z) the Barnes double-gamma function, (4.1) 

-w-n(-^)^"{E(^)Vi. (...) 

p ^ m>0 ^ ' 

Prom p.5p . it follows that the random matrix factor M{iu), for li € M, 
occurs as the limiting function for the mod-Gaussian convergence of some 
natural sequence of random variables (namely, characteristic polynomials 
of random unitary matrices of growing size distributed according to Haar 
measure) . It is thus natural to wonder whether the arithmetic factor has the 
same property, since, if that were the case, the formal properties of mod- 
Gaussian convergence (specifically, (3) in Proposition 12. 3p imply that there 
exists a sequence of random variables which converges in mod-Gaussian 
sense with limiting function A{iu)M{iu). We will show that this is the case 
(indeed with strong mod-Gaussian convergence); we believe this structure 
of the moments has arithmetic significance, although the computation we 
do now (though enlightening) does not yet explain this. 

Proposition 4.1. There exists a sequence {Zj^) of positive real-valued ran- 
dom variables and positive real numbers jm > such that 

locally uniformly for u G M. 

Proof. We start by writing A(iu) as a limit 

A{iu) = lim Ai{u,N)A2{u,N) 
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where 

A,iu,N)= J](l-p-l)-«^ (4.3) 

p<N 

Mu,N) = n E (^^S^)V" = n ^mu,iu,up-'). (4.4) 

p<Nm>0 ' ^ ' p<N 

by the definition of the Gauss hypergeometric function 

^, ^ , ^^a{a + l)---{a + k-l)b{b + l)---{b + k-l)z'' 
,F,{a,b;c;z) = }^^ ^(^ ^ 1) . . . (^ ^ ^ _ 1) ^- 

We now recah that 

JJ(l_p-i)^e-7(iogAr)-i 

p<N 

as N ^ +00, by the Mertens formula (see, e.g., [7, Th. 429]). Thus, it is 
natural to consider 

7^ = 2(7 + log log A^) >0, A^>2, 

because we then have 

lim e'''^^/'^A2(u,N)=A(iu). 

It is then enough to show that, for any N >2, the factor A2{u, N) is the 
characteristic function E(e*" '^) of a random variable Z]\[ to deduce 

lim e''^^^/'^E(e'''^^) = A(iu). 

Furthermore, since A2{u, N) is defined as a product, it is enough to show 
that each hypergeometric factor 2Fi{iu, iu; l;p~^), p prime, is the character- 
istic function of a random variable Xp to obtain the desired result with Zjsf 
the sum of independent variables distributed as Xp for p < N. Lemma 14.21 
below, applied with x = p, checks that this is the case. Then, finally, the 
convergence is locally uniform because so is the convergence of the Euler 
product defining A{iu). D 

Lemma 4.2. Let X be a complex-valued random variable uniformly dis- 
tributed over the unit circle, and let x be a real number with x > 1. Then 
we have 

^^^uilog |i-x-V2x|-2)^ = e(|1 - x-^/^^r^-) = 2Fi(m, iu; 1; x"^). 
Proof. With X as described, we have 



X 
1 ^ 



^^l_2Re(X) 



X \/X 
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which is always > (1 — x^^'^)^ > 0. Since Re(X) is distributed hke cosO, 
where is uniformly distributed on [0, 27r], we have 

E(e™'°s |i-x- V2x|-2^ ^J_ f ^ ^i^^-i _ 22;-i/2 cos 0)-™d0. 

27r Jo 

Now it is enough to apply [6, 9.112] (with n = 0, p = iu, z = x^^''^) to 
see that this expression is exactly 2Fi{iu, iu; 1; x~^). D 

Remark 4.3. In view of (11.20 and the Euler product (formal) expansion 



pl/2+it 
f 

the mod-Gaussian convergence of the arithmetic factor is described con- 
cretely as follows: let {Xp) be a sequence of independent random variables 
identically and uniformly distributed on the unit circle. Then the sequence 
of random variables defined by 



Xr, 



Xp 



5:iogi-^ =iogni f-P 

p<N Vi^ p<N V^ 

converges as N ^ -|-cxd, in the mod-Gaussian sense, with limiting function 
given by the arithmetic factor for the moments of |C(2 + ^i)P in (|1.2p . eval- 
uated at iu, and parameters (0, 21og(e'^log A^)). 

Remark 4.4. The computations above are implicit in some earlier work, for 
instance in the paper [4] of Conrey, Farmer, Keating, Rubinstein and Snaith 
on refined conjectures for moments of L-functions. 

4.2. Some families of L-functions over function fields. In this section, 
we give an example of mod-Gaussian convergence in the setting of families 
of L-functions, as developed by Katz and Sarnak [9]. We do not try to 
summarize the most general context in which they operate, in order to keep 
prerequisites from algebraic geometry to a minimum, concentrating on one 
concrete example which is already of great interest and can be explained 
"from scratch". It is the family of hyperelliptic curves, which is described 
in P §10.1.18, 10.8]. 

The fundamental result linking families of L-functions and Random Ma- 
trix Theory is the equidistribution theorem of Deligne, which we phrase (in 
the special case under consideration) in more probabilistic language than 
usual to clarify its meaning for probabilists. Its content is then that some 
sequences of random variables, defined arithmetically and taking values in 
the set of conjugacy classes in compact Lie groups such as U{N), converge 
in law to the image on the space of conjugacy classes of the probability 
Haar measure on the group. Consequently, the values of the characteristic 
polynomials of such random variables are approximately distributed like the 
variables Z]\f in the Keating-Snaith limit formula (II. ip for suitable values of 
N (or their analogues for other groups). 
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Let p be an odd prime number and let g = p", n > 1, be a power of p. 
We denote by Fg a field with q elements, in particular Fp = TLjpTL. Recall 
from the theory of finite fields that if we fix an algebraic closure F^ of F^, 
then for every n > 1 there exists a unique subfield Fgn of Fg which has order 
g"" (i.e., it is a field extension of degree n of Fg), which is characterized as 
the set of X G Fg such that x'^ = x. 

Let g > 1 be an integer, and let / G Fg[T] be a monic polynomial of degree 
2g + 1 with no repeated roots (in an algebraic closure ¥q of F^). Then the 
set Cf of solutions, in F^, of the polynomial equation 

Cf : y"^ = f{x) = x^^+^ + a2gx'^3 + ■ ■ ■ + aix + ao, (say). 

is called an affine hyperelliptic curve of genus g. Taking the associated 
homogeneous equation in projective coordinates [x : y : z], one gets the 
projective curve 

(5/ : y^z^^-^ = f{xz-^)z'^3+i = ^23+1 ^ a2gzx^3 + . . . + aiz'^^^ + a^z'^^+i ^ 

which is still smooth and corresponds to Cf with an added point at infinity 
with projective coordinates [0:1:0]. 

For every n > 1, denote by Cf{¥qn) the set of points in Cf which have 
coordinates in the subfield F^n of Fg (note that Cf (Fgn ) ~ Cf (F^n ) U { [0 : 
1 : 0}). The L- function Pf{T) of Cf (sometimes called the L-function of Cf 
instead) is then defined as the numerator of the zeta function Z{Cf) defined 
by the formal power series expansion 

n>l n>l 

which is known to represent a rational function of the form 

'^^^^^"(i-r)(i-gr)' 

which determines uniquely the L-function Pf. 

The following properties of Pf were all proved by 1945, in particular 
thanks to the work of A. Weil on the Riemann Hypothesis for curves over 
finite fields: 

- Pf is a. polynomial with integer coefficients of degree 2g, with Pf{0) = 1. 

- [Functional equation (F.K. Schmidt)] We have the polynomial identity 

- [Riemann Hypothesis (A. Weil)] If we write 

PfiT)= n (l-«/J^)' «/,J-eC, (4.5) 

l<j<2g 

then all the inverse roots Ofj satisfy | afj \ = y^. 
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The following property, which Weil could already prove in some form, is 
much better understood in the framework of algebraic geometry as developed 
in the 1960's by the Grothendieck school: 

- [Spectral interpretation] There exists a well-defined conjugacy class Ff in 
the set U{2g, C)^ of conjugacy classes in the compact unitary group U{2g, C) 
such that 

Pf{T) = det(l - q^/^TFf) (4.6) 

(this conjugacy class is the unitarized geometric Frobenius conjugacy class 
of Cf)- Moreover, there exists a non-degenerate alternating form (•, •) on C^^ 
such that Ff is a conjugacy class in USp{2g, {■, •)), the unitary symplectic 
group of matrices which leave the alternating form invariant. Because any 
two non-degenerate alternating forms are conjugate over C, we can (and 
will) see Ff as a well-defined conjugacy class in USp{2g,C), the unitary 
symplectic group for the standard symplectic form. 

Remark 4.5. If one makes the substitution T = q~^, s G C, to define a 
complex-variable L-function 

Lif,s)=Pfiq-^), (4.7) 

the functional equation and Riemann Hypothesis become exact analogues 
of the corresponding property and conjecture for the Riemann zeta function 
and its zeros, namely 

and all zeros of L(/, s) have real part equal to ^. 

The spectral interpretation, on the other hand, which implies that zeros 
of the L-function are eigenvalues of a unitary matrix (defined only up to 
conjugation), is much more mysterious for the Riemann zeta function. Note 
that the fact that Ff is in fact a symplectic conjugacy class implies the 
functional equation, by simple linear algebra. 

Here is now one particular case of the Deligne Equidistribution Theorem. 

Theorem 4.6 (Katz-Sarnak). Fix an integer g > 1. For every power q of 
an odd prime p, let TCg^q be the set of monic polynomials in Fq[T] of degree 
2g+l which have no multiple roots. Let Hgq be random variables with values 
in USp{2g, C)^ and with distributions given by 

F{Hg,q = C) = p^|{/ G ng,q I Ff = C}\ (4.8) 

for any conjugacy class C G USp{2g,C)K 

Then, as q ^ +oo, among odd powers of primes, the random variables 
Hg^q converge in law to a random variable Hg distributed according to 

F{Hg eA)= fig{A), (4.9) 

for any conjugacy-invariant measurable set A in USp{2g, C), where fig is the 
probability Haar measure on USp{2g, C) . In other words, Hg is distributed 
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according to the image on the space of conjugacy classes of the probability 
Haar measure on USp{2g,C). 

Proof. This is exactly Theorem 10.8.2 of Katz and Sarnak [9jp taking the 
choice of a^ to be y|fc[ so that the conjugacy class denoted 'd{k,ak,Cf/k) 
is the same as the class Fj for / G T~(-g,\k\ (see the discussion in 10.7.2 of [9]), 
and the distribution (j4.8p of Hg^g is the same as the measure fJ-{hyp, 2g + 
l,g,¥q,^) as defined in 10.8.1 of [9j. □ 

Remark 4.7. This result is derived by an application of Deligne's Equidistri- 
bution Theorem. Deligne's theorem is much more general: in fact, for any 
"algebraic" family of L- functions (a much more general notion that what we 
have described) there is always an equidistribution theorem which can be in- 
terpreted as convergence in law of random variables defined similarly to ()4.8p 
for some conjugacy classes associated with the family, to a random variable 
distributed according to the image of the probability Haar measure on a 
group which can be interpreted as "the smallest group for which equidistri- 
bution may conceivably hold" (see [9l §9.2, 9.3, 9.7] for detailed discussions 
of various versions). Much of the work in applying Deligne's Equidistri- 
bution Theorem is concentrated in the determination of this group (often 
called the geometric monodromy group of the family). For the case of the 
family given by the Ti-g^g in Theorem 14.61 the content of the result is that 
this monodromy group is the whole symplectic group Sp{2g), and this is 
proved in [9, Th. 10.1.18.3]. 

The following proposition is then immediate. 

Proposition 4.8. Let g > 1 be an integer. Let Hg^g be random variables as 
in Theorem \4.6\ For any A G C with Re(A) > 0, we have 

1 



li? 1^5^ ^A2-]o7^]E(det(l-F,,,)^) = M5p(A), 

g— >-|-cxD g— >+oo gv-^ +-^J/^ 



_;,2/2^7r^A/2 G(3/2) 



where 

In particular, for any integer k > 1, we have 

k 

Proof. For Re (A) > 0, the function 

X 1-^ det(l — x) 



Except that there is a typo in their statement, namely in the right-hand side of 
line 13, ^(mfrin, g, fci, a^. ) should be replaced by jj,{hyp,d,g,ki,aki) and line 14 can be 
deleted. 
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is continuous and bounded on USp{2g^<C)^; this is because, in terms of the 
eigenvalues e'^^ of x G USp{2g,C) arranged in pairs so that 02g+i-j = — %, 
we havqj 

det(l-a;)= J| |(l-e^^OP>0. 
i<i<9 
Keating and Snaith |11|, eq. (26) & (32)] have shown that 



hm 5-(^^+^)/2 E(det(l - H,)') = ,aV2 G(1 + A) Vr(l + A) 
9-+<« ^ ^ ^^ ' ^G(l + 2A)r(l + 2A) ^ ^ 

for any complex number A with Re(A) > — 1, and therefore the statement is 
a direct consequence of convergence in law of Hgg to Hg once we prove that 
our expression for Msp{X) coincides with this value. This, however, is easy 
to check: using the duplication formula for G{z) in the form 

G{^fG{2z) = {27r)-'2^'''^'+^r{z){G{z)G{z + l)f 

one can rewrite the function in the square-root of the denominator of the 
Keating-Snaith expression as 

r(l + 2A)G(1 + 2A) = G(2(l + A)) = (2^)-(^+i)22(^+i)'-2(^+i)+i 

/ G(l + A)G(3/2 + A) x2 

and deduce the result after a short computation. 

The last expression for Msp{k) is equation (34) in [llj and follows also 
from the definition of Msp{X) using G{z + 1) = T{z)G{z) and the formula 

^,, 1, (2fc-l)!! r- 

for A; > integer. D 

We can not argue quite so quickly to derive a mod-Gaussian convergence 
result because the random variables logdet(l — Hg^q) are not defined when- 
ever Hgq has an eigenvalue 1, and this can occur with positive probability. 
Indeed, by (|4.6p and (14. Sp . we have 



det(l-F^) = P^(g-V2)= -Q (i_g-i/2^^.) 

i<i<2<? 

for any / G "H^.g, and so the issue is whether ^ is a zero of Z{Cf) (equiv- 
alently, whether L(/, 1/2) = 0), and this may well happen (e.g., if q = p^ 
with p = 3 (mod 4), for the curve E : y"^ = x^ — x, it is well-known, and 
easy to show, that Pe{T) = (1 — pT)'^). 

Nevertheless, it is not too hard to prove the following: 



This is an analogue of the non-negativity of the central special value L{f, 1/2) for 
any self-dual L-function, which is implied by the Riemann Hypothesis. 
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Proposition 4.9. Let g > 1 be an integer. For any power q ^ 1 of an 
odd prime p, let Tig q he the subset of those f £ Tig g such that L{f, 1/2) = 
P;(g-V2) / 0. ' 

(1) Let Igq be random variables with values in USp{2g,C)* such that 

nig,, = C) = -^\{f € ng,g I Ff = C}\ 

for any C € USp{2g, C)K Then Ig^q converges in law to Hg as q ^- +oo. 

(2) Let Lg^q = logdet(/ — Ig^g) which is a well-defined real-valued random 
variable. We have the mod- Gaussian convergence 



lim lim 5-^"/2+«V2E(gmLs.,)=j\^ /^^)^ 

g— >+oo g^+oo 



for any u £ 



Note that we have here mod-Gaussian convergence with parameters given 
by (^ log g, log 5). 

Proof. (1) If 93 is a bounded continuous function on USp{2g,<Cy, we have 

I E{^ilg^g))-E{ip{Hg,q))\ < | | (^ | ^ P ( dct ( 1 " //<; ,y ) = O) = | | (^ | | OO ^(^g,^ G Ag), 

where Ag = {x £ USp{2g, C)^ \ det(l — x) = 0}. This set Ag is a closed set 
with empty interior, hence boundary equal to Ag, which has Haar measure 
zero. By Theorem 14.61 and the standard properties of convergence in law, 
we have 

lim F{Hg^g G Ag) = F{Hg E Ag) = 0, 
Q >--|-00 

and it follows then that 

hm EM4,))=EMF,J), 

g— >+oo 

which justifies the convergence in law of Igq. 

(2) For / G Tig^q, we have det(l — Ff) > 0, and therefore the definition of 
the law of Ig^g shows that Lg^q is well-defined. Because of the Keating-Snaith 
limit formula (j4.10p . valid for all complex numbers with Re(A) > —1, it is 
enough to show that, for all n G M, we have 

lim Eie'"'^^''') = E(det(l - F„)^"). 

g— >-)-oo 

The function 99 on USp(2g,C)'^ defined by 

fo, ifdet(l-x) = 

ldet(l — x)*", otherwise, 

is bounded and its set of points of discontinuity is the set Ag of Haar- 
measure 0. By a fairly standard result on convergence in law, this and the 
convergence in law of Ig^g to Hg suffice to ensure that 

lim E{^{Ig,q))=nipiHg)) 
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(see, e.g., [2l Ch. 4, §5, n°12, Prop. 22], properly translated, or one can 
of course do the necessary e management by hand). By definition, the left- 
hand side is E(e''"^'''''), while the right-hand side is E(det(l — -ffg)*") (since 
Ag has measure zero, once more). D 

Remark 4.10. Although we used the example of the family Tig^q in this sec- 
tion, it is clear from the proofs that the argument goes through with no 
change for any family with symplectic monodromy, and that suitable ana- 
logues will hold for families with unitary or (with a bit more care because of 
the issue of forced vanishing at the critical point) with orthogonal symmetry. 



Remark 4.11. In terms of L-functions as defined in (j4.7p . we can rephrase 
the last limit as follows: 

It remains a big problem to obtain results of this type without the inner 
limit over q, which already transforms the arithmetic to a pure "random 
matrix" problem by the magic of Deligne's equidistribution theorem (see 
the comments and conjectures in [9l p. 12, 13], in particular Example (2), 
p. 13). However, Faifman and Rudnick [5] and Kurlberg and Rudnick [13] 
have recently given examples of problems where it is possible to understand 
the limit g -^ +oo for fixed q. 

In this situation, by analogy with Section 14. H one may expect to have 
a mod-Gaussian limit theorem with an extra "arithmetic" factor. More 
precisely, one can make the following conjecture: 

Conjecture 4.12. For fixed odd q, we have 

u'^/2-iu/2 

hin \ Y. \LUM'^)r = Msr,{iu)An{iu) 

locally uniformly for u G M and also 

-A(A+l)/2 

hin ^ Y. I^(/, 1/2)1" = M5p(A)A,(A) 

g^+co \nna\ ~ 

for all real A > 0, where 

A(A+l)/2 



A(A) = n (l - FT 






the product extending over all irreducible monic polynomials it G ^^[T], and 
Jtt] = g'^'^sl'r) denoting the norm o/vr. 



MOD-GAUSSIAN CONVERGENCE 27 

The shape of arithmetic factors is motivated by the fact (essentially ob- 
served by Kurlberg and Rudnick) that for a fixed vr, the vr-factor of the Euler 
product representing L{f, 2) converges in law to 

(i-T.ivrr^^ri 

where T^ is a trinomial random variable taking value with probability jTrn 
and values ±1 with equal probability ^ ( 1 — ra+T ) ' ^^^ inner parenthesis on 

the right of the arithmetic factor is, of course, equal to ]E((1 — Tt^IttI^^''^)^'^) 
(as one sees after rearrangement of the latter). 

4.3. The number of prime divisors of an integer. A classical result 
of Erdos and Kac states (as a particular case) that the arithmetic function 
a;(n), the number of (distinct) prime divisors of a positive integer n > 1, 
behaves for large n like a Gaussian random variable with mean log log n and 
variance log log n, in the sense that 

l,r ,. , u;(n) -loglogiV ,,, If'' ,2/,, 

N^+ooN^^ - ' VloglogAf ^' V2^Ja 

(4.11) 
for any real numbers a < b. 

This phenomenon where increasing variance is observed suggests, in the 
context of this paper, to look at the behavior of iv{n) over n < N without 
normalizing. However, mod-Gaussian behavior is excluded here because of 
the following easy remark: 

Proposition 4.13. Let {Z^) be a sequence of integer valued random vari- 
ables. Then {Z^) does not converges in the mod- Gaussian sense with respect 
to any unbounded parameters (jn)- 

Proof. The point is that the characteristic functions 

E(e™^^) 

are 27r-periodic for all A^ if the Z^ take only integral values. If u = 2it (or 
any other non-zero integral multiple of 2tt) then the limit (jl.6p implies 

lim 6"^^/^ = 1^(1)1, 

Af^+00 

and (since 7Ar > 0) the existence of this limit implies that 'Jn converges to 
7 > 0. D 

However, it turns out that there is mod-Poisson convergence, in the sense 
sketched in Section 12. 2i For this, it seems slightly more appropriate to 
consider 

uj'{n)=uj{n)-l (4.12) 

for n > 2, because Poisson random variables takes all integral values > 0, 
whereas (jj{n) > 1 for any n > 2 (of course, M.llh is valid for iAj'{n) also). 
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To state the result precisely, recall that a Poisson random variable Pa 
with parameter A > is one taking (almost surely) integer values k > 
with 

P(^A = k) = -^e-\ 

The characteristic function is then given by 

E(e^"-P^) = exp(A(e™ - 1)), 

and strong mod-Poisson convergence of a sequence Zjsi of random variables 
with parameters Atv means that the limit 

lim exp(Ajv(l - e^"))E(e*"^^) = $(u) 

Af^+oo 

exists for every n G M, and the convergence is locally uniform. The limiting 
function $ is then continuous and $(0) = 1. 

Proposition 4.14. For n G M, let 

*(")=f(?^Ti)nO-^) (i+^)' w.«) 

This Euler product is absolutely and locally uniformly convergent. More- 
over, for any u G M, we have 

lim ^^°g^^^''^'"^ y e-'W = c^(n), 

2<n<N 

and the convergence is locally uniform. 
Proof. Since 

2<n<N l<n<N 

for N > 2, this is in fact a simple reinterpretation of a direct applica- 
tion of the Delange-Selberg method (see, e.g., [181 H-S, Theorem 3]) to the 
multiplicative function n i— > g™"^'"^. The details are explained in |18[ II. 6, 
Theorem 1] (take N = 0, z = e^^ , A = 1 there, and apply Ao(^;) = 1 to get 
the formula 

1 „ , .„, 1 -i-r / l\e' 



r(j»)0,(i;O = ^i-jn(i-^) {^ + —, 

for the limit, which is in the notation of loc. cit., with Gi{s;z) defined on 
the last line of p. 201 of pE|, its analytic continuation to Re(s) > 1/2 being 
described on p. 202). Multiplying by e~*", we obtain the stated result using 

gmp(gm) = r(e^" + 1). n 
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Corollary 4.15. Consider random variables M^, for N >2, such that 

F{Mn = n) = j^—j, 2<n<N, 

and let Z^ = oj'{Mn). Then the sequence (Z^) converges strongly in the 
mod-Poisson sense with limiting function $ given by d^. J3 ) and parameters 
Xn = log log A^. 

Proof. This follows directly from the proposition and the definition of strong 
mod-Poisson convergence, since we have E(e*" ^iv)^i = (log A^)^"^^^'"). D 

The analogue of Corollary 13.31 is then the theorem of Erdos and Kac: 

Corollary 4.16. The Gaussian limit (j4.1ip holds. 

Proof. With notation as in Corollary 14.151 we must show that 

^ _ cj(Mjv)-loglogJV 
Vlog log N 

converges in law to a standard Gaussian variable (with a shift from N to 
A^ + 1 which is immaterial). The argument is again quite standard, but we 
spell it out in detail. 

Let u G M be fixed; the characteristic function of Y/v is 

E^gmViv) = exp(-mVloglog A^) E(e^*'^(^^^)) 

= exp(-mv^loglogA^ + it) E(e^*'^'(*^^)) (4.14) 

where 

u 
t 



Vlog log A^ 
(note that t depends on A'" and t ^ when A^ -^ +c«). 

By Proposition 14.141 in particular the uniform convergence with respect 
to u, we have 

lim (log N)^-^'' E(e**'^'(^'^^)) = $(0) = 1. (4.15) 

Moreover, we have for A^ > 1 

(logA^)^"-^ = exp((e^* - 1) log log A^) 

= exp((it - t'^/2 + 0{t^)) log log A^) 

= exp(m0oglogAr-^ + o( "" )). (4.16) 

V 2 \ V log log A/ / / 

Writing (Ii4i|) as 
exp(-mVloglogA^ + it) X {log Nf'-^ x (log A^)!"^'* E(e^*'^'(^^^)), 
we see from (14.15P and (14.16P that this is 

and by Levy's criterion, this concludes the proof. D 
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In fact, this proof is essentially the one of Renyi and Turan [16j, who 
simply did not isolate Proposition 14.141 as a separate statement of interest 
(and proved directly the version of the Delange-Selberg needed in this case, 
see equation (1.31) in loc. cit.). 

Remark 4.17. It is also natural to use Poisson variables because the asymp- 
totic formula 

1 IJ ^ ^^l 'f ^ Ml 1 (log log JV)' 

holds as N -^ +co, for fixed k > (the uniformity with respect to k, as 
shown first by Sathe and Selberg, is a quite delicate issue, see, e.g., [18; II. 6, 
Theorem 4]), so that uj'{n) is again seen to be "approximately" Poisson with 
parameter log log N. 

Our focus here is on arithmetic behavior, but it is interesting to note a 
phenomenon similar (though apparently simpler) to what happens for the 
Moment Conjecture for C(l/2 + it), namely, that the limiting function <I>(n) 
takes the form of a product 

with 

*'<"' = r(5^TT)- *'^(«) = n(i-p) (i-p)(i + ?TT, 

and we can see the Euler product ^2 as an instance of a natural asymptotic 
probabilistic model of primes, while $1 comes from a seemingly unrelated 
model which has some group-theoretic origin. 

Indeed, for the first factor ^i{u) = r(e*" + 1)^^, appealing to the classical 
formula 

1 TT / e*"\/ l\-e'" 

f(^»TT)=n(i + T)(^+A:) 

^ ^ A;>1 

for any n G M (due to Euler; see |19^ 12.11]), we can compute this as follows: 

-i-r / 1\ 1-e™ / 1\ -1 / e*" 

*'(«) = „lim^n(i + j:) (i + l) (i + T 

k<N 

= lim exp(A.(l-0)n(ln)"(^ + 

k<N 

= lim exp(Aiv(l-e™))E(e*"^'^), 



where 



Xn= Yl log{l + k-')=log{N + l) 

l<k<N 

and Z]si is the sum 

Zj^ = Bi + B2 + • • • + Bj\[ , 
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with Bk denoting independent Bernoulli random variables with distribution 

F(Bk = 1) = 1 ^ = ^— , F(Bk = 0) = -^ = — ^. 

^ " ' 1 + i fc + l' ^ ^ ' 1 + i k + l 

This distribution is also found, for instance, as the distribution of i^Ar+i — 
1, where i^Ar+i is the number of cycles of a permutation a € ©at, distributed 
according to the uniform measure on the symmetric group (see, e.g., [U 
§4.6]; we observe in particular that this gives another instance of natural 
mod-Poisson convergence explaining some type of normal limit). So here we 
see random permutations occuring as explaining the "transcendental" factor 
$i(ti); in [12j, this phenomenon is explored in greater depth. 

For the arithmetic factor $2(^^)1 we argue (much as in Section [4. Ih that 

^,{u)= hin n(i--) (i--)(i+^ 

y^+00 J- J- V pj \ pj \ p—1 

p<y 

and by isolating the first term, it follows that 

$2(^i)= lim exp((l - e'")Aj,) n f 1 - - + -e' 

y-^+00 " ±A. \ p p 

p<y 

= lim E(e^"-^^0~^IE(e™^^) 

y—^+oo 

where 

-^2^ = X]^°S(1-P"^)""^ = ^2]^ ~ log logy, as y ^ +00, 

P<y p<y 

k>l 

and 

Zy = T.K (4.17) 

p<y 
is a sum of independent Bernoulli random variables with parameter 1/p: 

F{B'p = l) = -, P{b; = 0) = 1--. 
^ p f p 

The parameters of these Bernoulli laws correspond exactly to the "intu- 
itive" probability that an integer n be divisible by p, or in other words to 

the model limit 

1 v^ 1 

lim — > 1 = — , 

x^+oo X -^^ p 

n<x 
n=0 (modp) 

and the independence of the B' corresponds to the formal (algebraic) in- 
dependence of the divisiblity by distinct primes given, e.g., by the Chinese 
Remainder Theorem. 

Since the analogue of Proposition 12.31 (3), is trivially valid for mod- 
Poisson convergence, we also recover in this manner (without arithmetic) 
the fact that the limiting function ^{u) arises from mod-Poisson conver- 
gence. 
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As in the case of the Riemann zeta function, we note that the independent 
model fails to capture the truth on the distribution of u;(n), the extent of 
this failure being given by the factor <l>i(u). Because 



Zy -loglogy law 



AA(0,1) 



\/log log y 

(with the right-hand side being a standard normal random variable), this 
discrepancy between the independent model and the arithmetic truth is 
invisible at the level of the normalized convergence in distribution. 

Remark 4.18. Computations similar to the above show that, for any se- 
quence {xn) of positive real numbers with 

^ a;„ = -hoo, ^ a;^ < +00, (4.18) 

n>l n>l 

if we denote now by (-Bn) a sequence of independent Bernoulli random vari- 
ables with 

¥{Bn = 1) = Xn, ^Bn = 0) = 1 - X„, 

then the random variables 

Ztv = -Bi + • • • + Bn 
have mod-Poisson convergence with parameters 

Xn = Xl^ + Xn, 

and with limiting function given by 



[](l + x„(e»-l))exp(x„(l-e™)) 



u 1-^ 

n>l 

the (uniform) convergence of this infinite product is ensured by the second 
condition in ()4.18p . after expanding in terms of Xn (which tends to as 

n -^ -|-oo). 
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